Perceptual Spatial - Audio Coding
نویسندگان
چکیده
A novel technique for the perceptual coding of spatial audio is presented. This coding technique allows individualized 3D audio presentation and exploits the dichotomous roles of the lowfrequency interaural timing and level difference cues versus the high-frequency spectral cues in human sound localization. The high-frequency spectral cues are modified to match the acoustics of the listener’s outer ears, while preserving the original lowfrequency interaural cues. The psychoacoustic principles and theory behind the coding scheme are described and sound localization data are shown demonstrating the fidelity of the coding technique. Based on the coding technique, we develop the notion of directional frequency bands and give some basic requirements for a 3D audio recording and reproduction system.
منابع مشابه
Efficient audio compression based on the perceptual coding of spatial audio
Digital compression of audio signals is one of the technological fields that has always been tightly linked to detailed knowledge on human perception. Recent advances have extended the sources for bit-rate reduction from a predominantly masking-oriented approach towards additional exploitation of spatial perceptual irrelevancies. New techniques aim at modeling of perceptually-relevant spatial c...
متن کاملBinaural cue coding-Part II: Schemes and applications
Binaural Cue Coding (BCC) is a method for multichannel spatial rendering based on one down-mixed audio channel and side information. The companion paper (Part I) covers the psychoacoustic fundamentals of this method and outlines principles for the design of BCC schemes. The BCC analysis and synthesis methods of Part I are motivated and presented in the framework of stereophonic audio coding. Th...
متن کاملA warped linear-prediction-based subband audio coding algorithm
In this paper, a novel audio coding algorithm is proposed where the warped linear prediction (WLP) technique is employed to construct a perceptual preand post-filter for subband audio coding. A modified signal-to-mask ratio (SMR) calculation is given for subband coding of the WLP residuals of audio signals. The concept of perceptual entropy (PE) is extended to subband coding, resulting in the s...
متن کاملJND-based spatial parameter quantization of multichannel audio signals
In multichannel spatial audio coding (SAC), the accurate representations of virtual sounds and the efficient compressions of spatial parameters are the key to perfect reproduction of spatial sound effects in 3D space. Just noticeable difference (JND) characteristics of human auditory system can be used to efficiently remove spatial perceptual redundancy in the quantization of spatial parameters...
متن کاملA feasibility study regarding implementation of holographic audio rendering techniques over broadcast networks
At the present time, 5 channel surround sound has become standard for a high quality audio reproduction. In the nearest future new rendering systems with higher number of audio channels will be introduced to the consumer market. One of emerging audio rendering technologies is Wave Field Synthesis (WFS), which creates a perceptually correct spatial impression over the entire listening area. The ...
متن کامل